SEQUENCE LISTING 

(1) General Information 



(i) APPLICANT: SANTEN PHARMACEUTICAL CO., LTD. 

(ii) TITLE OF INVENTION: Novel Polypeptide Having Water Channel 
Activity and DNA sequence 

(iii) NUMBER OF SEQUENCES: 2 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SANTEN PHARMACEUTICAL CO., LTD. 

(B) STREET: 9 ? 19 Shimoshinjo 3-chome Higashiyodogawa-Ku 

(C) CITY: Osaka 

(D) STATE: Osaka 

(E) COUNTRY: JAPAN 
{F} ZIP: 533-0021 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3.5 inch, 1.44 MB, storage 

(B) COMPUTER: IBM PS/2 or compatibles 

(C) OPERATING SYSTEM: WINDOWS 95/97 

(D) SOFTWARE: Microsoft Word 97 

(vi) CURRENT APPLICATION DATE: 

(A) APPLICATION NUMBER: 0 9/381,810 

(B) FILING DATE: 19-OCT-1999 

(C) CLASSIFICATION: 435 

(vii) PRIOR APPLICATION DATE 

(A) APPLICATION NUMBER: JP09 094845 

(B) FILING DATE: 28-MAR-1997 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Burton A. Amernick 

(B) REGISTRATION NUMBER: 24852 

(C) REFERENCE/DOCKET NUMBER: 1581/00156 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (2 02)331-7111 

(B) FAX: (202)293-6229 

(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH F342 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY F linear 

(ii) MORECULE TYPE F peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

Met Val Gin Ala Ser Gly His Arg Arg Ser Thr Arg Gly Ser Lys Met 

5 10 15 

Val Ser Trp Ser Val He Ala Lys He Gin Glu He Leu Gin Arg Lys 

20 25 30 

Met Val Arg Glu Phe Leu Ala Glu Phe Met Ser Thr Tyr Val Met Met 

35 40 45 

Val Phe Gly Leu Gly Ser Val Ala His Met Val Leu Asn Lys Lys Tyr 

50 55 60 

Gly Ser Tyr Leu Gly Val Asn Leu Gly Phe Gly Phe Gly Val Thr Met 

70 75 80 

Gly Val His Val Ala Gly Arg He Ser Gly Ala His Met Asn Ala Ala 

85 90 95 

Val Thr Phe Ala Asn Cys Ala Leu Gly Arg Val Pro Trp Arg Lys Phe 



100 105 110 

Pro Val Tyr Val Leu Gly Gin Phe Leu Gly Ser Phe Leu Ala Ala Ala 

115 120 125 

Thr He Tyr Ser Leu Phe Tyr Thr Ala He Leu His Phe Ser Gly Gly 

130 135 140 

Gin Leu Met Val Thr Gly Pro Val Ala Thr Ala Gly He Phe Ala Thr 

150 155 160 

Tyr Leu Pro Asp His Met Thr Leu Trp Arg Gly Phe Leu Asn Glu Ala 

165 170 175 

Trp Leu Thr Gly Met Leu Gin Leu Cys Leu Phe Ala Thr Thr Asp Gin 

180 185 190 

Glu Asn Asn Pro Ala Leu Pro Gly Thr Glu Ala Leu Val He Gly He 

195 200 205 

Leu Val Val He He Gly Val Ser Leu Gly Met Asn Thr Gly Tyr Ala 

210 215 220 

He Asn Pro Ser Arg Asp Leu Pro Pro Arg He Phe Thr Phe He Ala 
225 230 235 240 

Gly Trp Gly Lys Gin Val Phe Ser Asn Gly Glu Asn Trp Trp Trp Val 

245 250 255 

Pro Val Val Ala Pro Leu Leu Gly Ala Tyr Leu Gly Gly He He Tyr 
260 265 270 

^ Leu Val Phe He Gly Ser Thr He Pro Arg Glu Pro Leu Lys Leu Glu 
^ 275 280 285 

Asp Ser Val Ala Tyr Glu Asp His Gly He Thr Val Leu Pro Lys Met 
ffl 290 295 300 

4=, Gly Ser His Glu Pro Thr He Ser Pro Leu Thr Pro Val Ser Val Ser 
305 310 315 320 

Pro Ala Asn Arg Ser Ser Val His Pro Ala Pro Pro Leu His Glu Ser 
m ^25 330 335 

Met Ala Leu Glu His Phe 
340 



O 



'y^ (2) INFORMATION FOR SEQ ID No F2 
^ (i) SEQUENCE CHARACTERISTICS: 
P (A) LENGTH F1258 base pairs 
^ (B) TYPE: nucleic acid 
n (C) STRANDEDNESS : double 
n (D) TOPOLOGY F linear 
^ (ii) MOLECULE TYPE: cDNA tO mRNA 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(B) TISSUE TYPE: fat tissue 
(ix) FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION F173 . .1198 

(C) IDENTIFICATION METHOD: by experiment 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GGCTCTGGAC TGGGGACACA GGGATAGCTG AGCCCCAGCT GGGGGTGGAA GCTGAGCCAG 6 0 
GGACAGTCAC GGAGGAACAA GATCAAGATG CGCTGTAACT GAGAAGCCCC CAAGGCGGAG 120 
GCTGAGAATC AGAGACATTT CAGCAGACAT CTACAAATCT GAAAGACAAA AC ATG GTT 17 8 

Met Val 
1 

CAA GCA TCC GGG CAC AGG CGG TCC ACC CGT GGC TCC AAA ATG GTC TCC 226 
Gin Ala Ser Gly His Arg Arg Ser Thr Arg Gly Ser Lys Met Val Ser 
5 10 15 



u 

m 

Pi 



c 



TGG TCC 
Trp Ser 
20 

CGA GAG 
Arg Glu 
35 

GGC CTT 
Gly Leu 

TAG CTT 
Tyr Leu 

CAC GTG 
His Val 

TTT GCT 
Phe Ala 
100 
TAT GTG 
Tyr Val 
115 

TAG AGT 
Tyr Ser 

ATG GTG 
Met Val 

CCT GAT 
Pro Asp 



ACC 
Thr 

AAC 
Asn 
195 
GTC 
Val 



GGG 
Gly 
180 
CCA 
Pro 

ATC 
He 



CCG TCC 
Pro Ser 

GGC AAA 
Gly Lys 



GTG 
Val 

TTC 
Phe 
275 
GTG 
Val 



GCA 
Ala 
260 
ATT 
He 

GCG 
Ala 



GTG 

Val 

TTC 
Phe 

GGT 
Gly 

GGT 
Gly 

GCA 
Ala 
85 
AAC 
Asn 

CTG 
Leu 

CTC 
Leu 

ACC 
Thr 

CAC 
His 
165 
ATG 
Met 

GCA 
Ala 

ATC 
lie 

CGG 
Arg 

CAG 
Gin 
245 
CCA 
Pro 



ATA GCA AAG 
He Ala Lys 



CTG 
Leu 

TCC 
Ser 

GTC 
Val 
70 
GGC 
Gly 



GCC GAG 
Ala Glu 
40 

GTG GCC 
Val Ala 
55 

AAC TTG 
Asn Leu 

CGC ATC 
Arg He 



TGT GCG CTG 
Cys Ala Leu 



GGG CAG 
Gly Gin 



TTC 
Phe 

GGT 
Gly 
150 
ATG 
Met 



TAC 
Tyr 
135 
CCC 
Pro 

ACA 
Thr 



TTC 
Phe 
120 
ACG 
Thr 

GTC 
Val 

TTG 
Leu 



CTC CAG CTG 
Leu Gin Leu 



CTG 
Leu 

GGG 
Gly 

GAC 
Asp 
230 
GTC 
Val 



CCA 
Pro 

GTG 
Val 
215 
CTG 
Leu 



GGA 
Gly 
200 
TCC 
Ser 

CCC 
Pro 



TTC AGC 
Phe Ser 



CTT CTG GGT 
Leu Leu Gly 



CAT GAA 
His Glu 



GGC 
Gly 

TAT 
Tyr 

CCC 
Pro 



TCC ACC ATC 
Ser Thr He 
280 

GAA GAC CAC 
Glu Asp His 

295 
ACG ATC TCT 
Thr He Ser 
310 



ATC 
He 
25 
TTC 
Phe 

CAT 
His 

GGT 
Gly 

TCT 
Ser 

GGC 
Gly 
105 
CTG 
Leu 

GCC 
Ala 

GCT 
Ala 

TGG 
Trp 

TGT 
Cys 
185 
ACA 
Thr 

CTT 
Leu 

CCC 
Pro 

AAT 
Asn 

GCC 
Ala 
265 
CCA 
Pro 



CAG GAA ATA CTG 
Gin Glu He Leu 



ATG AGC 
Met Ser 

ATG GTT 
Met Val 

TTT GGC 
Phe Gly 
75 

GGA GCC 
Gly Ala 
90 

CGC GTG 
Arg Val 



ACA 
Thr 

CTA 
Leu 
60 
TTC 
Phe 



TAT 
Tyr 
45 
AAT 
Asn 

GGA 
Gly 



CAC ATG 
His Met 

CCC TGG 
Pro Trp 



GGC 
Gly 

ATT 
He 

ACA 
Thr 

CGG 
Arg 
170 
CTC 
Leu 



TCC 
Ser 

CTC 
Leu 

GCT 
Ala 
155 
GGC 
Gly 



TTC 
Phe 

CAC 
His 
140 
GGC 
Gly 



CTG 
Leu 
125 
TTT 
Phe 

ATT 
He 



TTC CTG 
Phe Leu 



TTC GCC ATC 
Phe Ala Thr 



GAG GCG 
Glu Ala 

GGC ATG 
Gly Met 

CGC ATC 
Arg He 
235 
GGG GAG 
Gly Glu 
250 

TAT CTA 
Tyr Leu 



CTG 
Leu 

AAC 
Asn 
220 
TTC 
Phe 



GTG 
Val 
205 
ACA 
Thr 

ACC 
Thr 



CAG 
Gin 
30 
GTC 
Val 

AAA 
Lys 

GTC 
Val 

AAC 
Asn 

AGG 
Arg 
110 
GCG 
Ala 

TCG 
Ser 

TTT 
Phe 

AAT 
Asn 

ACG 
Thr 
190 
ATA 
He 

GGA 
Gly 

TTC 
Phe 



AGG AAG ATG GTG 
Arg Lys Met Val 



ATG ATG 
Met Met 

AAA TAT 
Lys Tyr 



ACC 
Thr 

GCA 
Ala 
95 
AAG 
Lys 

GCT 
Ala 

GGT 
Gly 

GCC 
Ala 

GAG 
Glu 
175 
GAC 
Asp 



ATG 
Met 
80 
GCT 
Ala 

TTT 
Phe 

GCC 
Ala 

GGA 
Gly 

ACC 
Thr 
160 
GCG 
Ala 



GTA TTC 
Val Phe 
50 

GGG AGC 
Gly Ser 
65 

GGA GTG 
Gly Val 

GTG ACC 
Val Thr 

CCG GTC 
Pro Val 

ACC ATC 
Thr He 
130 
CAG CTG 
Gin Leu 
145 

TAC CTT 
Tyr Leu 

TGG CTG 
Trp Leu 



CAG GAG AAC 
Gin Glu Asn 



GGC ATC 
Gly He 

TAT GCC 
Tyr Ala 



ATT 
He 



AAC TGG 
Asn Trp 

GGT GGC 
Gly Gly 



GGG 
Gly 

CCC 
Pro 



CGG GAG 
Arg Glu 

ATA ACC 
He Thr 

CTC ACC 
Leu Thr 
315 



CCC 
Pro 

GTA 
Val 
300 
CCC 
Pro 



CTG 
Leu 
285 
TTG 
Leu 

GTC 
Val 



TGG 
Trp 

ATC 
He 
270 
AAA 
Lys 

CCC 
Pro 

TCT 
Ser 



TGG 
Trp 
255 
ATC 
He 



GCT 
Ala 
240 
GTG 
Val 

TAC 
Tyr 



CTC 
Leu 

ATC 
He 
225 
GGT 
Gly 



GTG 
Val 
210 
AAC 
Asn 

TGG 
Trp 



CCA GTG 
Pro Val 

CTG GTC 
Leu Val 



TTG GAG GAT TCT 
Leu Glu Asp Ser 
290 

AAG ATG GGA TCT 
Lys Met Gly Ser 
305 

GTG AGC CCT GCC 
Val Ser Pro Ala 
320 



274 



322 



370 



418 



466 



514 



562 



610 



658 



706 



754 



802 



850 



898 



946 



994 



1042 



1090 



1138 



AAC AGA TCT TCA GTC CAC CCT GCC CCA CCC TTA CAT GAA TCC ATG GCC 1186 
Asn Arg Ser Ser Val His Pro Ala Pro Pro Leu His Glu Ser Met Ala 

325 330 335 

CTA GAG CAC TTC TAAGCAGAGA TTATTTGTGA TCCCATCCAT TCCCCAATAA 12 3 8 

Leu Glu His Phe 
340 

AGCAAGGCTT GTCCGACAAA loco 



?=5 



m 



